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INTRODUCTION: 

tRNAs  read  mRNA  codons  in  translation  and  are  essential  for  protein  synthesis.  Like 
mRNA,  the  concentration  and  the  identity  of  tRNAs  are  also  under  stringent  cellular  control.  Our 
project  aims  to  determine  whether  tRNAs  serve  as  biomarkers  for  breast  cancer  cells  and  whether 
tRNAs  in  normal  and  breast  cancer  cell  lines  control  the  expression  of  selected  genes  that  are 
important  for  the  tumorigenic  process.  Finding  tRNA  as  biomarkers  could  open  up  a  new  and  so  far 
under- appreciated  avenue  for  detecting  the  type  and  stage  of  breast  cancer  progression. 
Identification  of  tRNAs  that  regulate  tumor  development  and  progression  could  produce  targets  for 
a  new  class  of  cancer  drugs. 

BODY  (Tables  and  Figures  at  the  end  of  the  text): 

Task  1:  Development  of  a  “second-generation”  tRNA  microarray  capable  of  studying 
single -nucleotide  differences  in  human  tRNA.  ->  Completed  when  the  last  annual  report  was 
written  in  August  2008. 

Task  2:  Development  of  a  “third-generation”  method  capable  of  studying  the  expression  of  >90% 
of  human  tRNA  species.  ->  In  development. 

After  the  successful  development  of  the  “second-generation”  microarray  method  (task  1), 
we  realized  that  the  most  effective  way  for  completing  task  2  is  to  apply  the  newly  available,  high 
throughput  DNA  sequencing  method  (e.g.  Solexa  technology  by  Illumina).  This  new  technology 
offers  a  much  higher  data  output  at  a  significantly  reduced  cost.  We  are  continuing  to  work  on  this. 

Task  3:  Evaluate  the  usefulness  of  tRNA  expression  pattern  as  biomarkers  Completed. 

To  explore  the  potential  of  tRNAs  to  define  breast  cancer  signatures,  we  used  tRNA 
microarrays  to  generate  comparative  tRNA  profiles  for  three  non-tumorigenic  breast  epithelial  cell 
lines  (MCF10A,  184  Al,  184  B5)  and  six  breast  cancer  cell  lines  (BT-474,  HCC70,  MCF7,  MDA- 
MB-231,  MDA-MB-436,  ZR-75-1).  The  breast  cancer  cell  lines  cover  a  range  of  physiological  and 
molecular  properties  (Table  1).  Importantly,  genome-wide  mRNA  expression  data  are  available  for 
these  cell  lines  (1)  to  allow  tRNA  expression  and  codon  usage  correlation  analysis. 

Significant  differences  are  observed  in  the  expression  levels  of  tRNA  among  non- 
tumorigenic  and  tumorigenic  breast  cell  lines  (Fig.  1A,  1C).  The  global  level  of  nuclear-  and 
mitochondrial-encoded  tRNAs  can  be  approximated  separately  by  the  median  and  mean  sample-to- 
MCF10A  ratio.  For  nuclear-encoded  tRNAs,  this  ratio  is  0.7-0. 8  for  the  other  non-tumorigenic  cell 
lines  and  2-3  for  the  tumorigeniccell  lines.  These  differences  are  even  more  pronounced  for 
mitochondrial-encoded  tRNA  levels:  the  sample-to-MCFlOA  ratio  for  non-tumorigenic  cell  lines  is 
approximately  0.75,  but  as  much  as  5  for  the  tumorigenic  cells. 

The  over-expression  of  tRNA  in  tumorigenic  relative  to  non-tumorigenic  cell  lines  is  also 
selective:  certain  individual  tRNAs  are  more  strongly  over-expressed  than  others  (Fig.  1C). 
Variations  in  the  relative  expression  of  nuclear-encoded  tRNAs  carrying  certain  amino  acid  types 
are  readily  observed  across  tumorigenic  cell  lines.  For  example,  nuclear-encoded  tRNAs  carrying 
polar  amino  acids  (e.g.  Ser,  Thr,  and  Tyr)  are  up-regulated  3  to  4-fold  in  breast  cancer  cell  lines 
relative  to  MCF10A,  while  nuclear-encoded  tRNAs  carrying  small  amino  acids  (Ala,  Cys,  Gly,  Pro) 
are  up-regulated  only  1.5  to  2-fold.  These  differences  can  be  observed  more  clearly  when  the  tRNA 
expression  is  normalized  to  the  median  value  for  either  nuclear-  or  mitochondrial-encoded  tRNAs 
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within  each  cell  line  (Fig.  1C,  ID).  Differences  in  the  relative  expression  of  tRNA  isoacceptors  also 
become  more  apparent  after  normalization.  Selective  up-regulation  of  tRNA  levels  is  also  observed 
for  mitochondrial-encoded  tRNAs:  certain  tRNAs  are  expressed  up  to  2-fold  above  median,  and 
others  are  expressed  4-fold  below  median.  While  nuclear-encoded  tRNA  expression  patterns  are 
remarkably  similar  across  cell  lines,  mitochondrial-encoded  tRNA  expression  patterns  exhibit 
greater  variations. 

We  also  detennined  that  tRNA  over-expression  in  breast  cancer  cell  lines  does  not  simply 
reflect  an  increase  in  proliferation  rate  (Table  2).  Under  the  culture  conditions  used  in  this  study, 
doubling  times  range  from  21  to  47  hours  for  the  non-tumorigenic  cell  lines,  versus  15  to  35  hours 
for  the  tumorigenic  cell  lines.  Thus,  the  proliferation  of  all  three  non-tumorigenic  cell  lines  with  low 
tRNA  content  is  comparable  to  that  of  tumorigenic  breast  cancer  cell  lines  with  much  higher  tRNA 
content.  Doubling  times  do  not  correlate  with  either  nuclear  or  mitochondrial  global  tRNA  levels 
(not  shown). 

tRNA  over-expression  in  patient-derived  breast  tumor  samples:  In  all  breast  cancer  cell  lines 
examined,  we  observe  global  tRNA  over-expression  and  differential  tRNA  isoacceptor  expression. 
To  generalize  our  results  in  cell  lines  to  breast  cancer  in  vivo,  we  measured  tRNA  expression 
patterns  in  9  patient-derived  breast  tumor  samples  and  3  nonnal  breast  tissue  samples  (Table  3).  The 
9  breast  tumor  samples  were  selected  from  the  three  major  subtypes  of  breast  cancer:  luminals 
(ER+,  HER2-),  basals  (ER-,  HER2-),  and  HER2+.  For  consistency  of  data  analysis,  all  samples 
were  run  using  MCF10A  as  a  reference  in  array  experiments. 

As  in  the  breast  cancer  cell  lines,  we  observe  significant  differences  in  global  tRNA 
expression  levels  among  breast  tumor  and  normal  breast  samples  (Fig.  2A,  2B).  For  nuclear- 
encoded  tRNAs,  the  mean  sample-to-MCFlOA  ratios  is  0.2  to  0.5  for  normal  breast  tissue  samples, 
compared  to  2  to  4  for  breast  tumor  samples.  This  translates  to  up  to  20-fold  upregulation  of  global 
nuclear-encoded  tRNA  levels  in  breast  tumors  relative  to  nonnal  breast.  For  mitochondrial-encoded 
tRNAs,  the  mean  sample-to-MCFlOA  ratio  is  0.4  to  1  for  normal  breast  tissue  samples,  compared  to 
1.2  to  5  for  breast  tumor  samples.  This  translates  to  up  to  13-fold  upregulation  of  global 
mitochondrial-encoded  tRNA  levels  in  breast  tumors  relative  to  normal  breast. 

Consistent  with  our  results  on  cell  lines,  tRNA  over-expresssion  in  breast  tumor  samples  is 
also  selective:  certain  individual  tRNAs  are  more  strongly  over-expressed  than  others  (Fig.  2C). 
Among  the  top  10  over-expressed,  nuclear-encoded  tRNAs,  six  tRNA  species  overlap  for  cell  lines 
and  tumor  tissues.  We  readily  observe  variations  in  the  relative  expression  of  both  nuclear-  and 
mitochondrial-encoded  tRNAs.  These  variations  correlate  with  both  the  cognate  amino  acid  type 
and  tRNA  isoacceptor  identity.  We  also  observe  variations  in  the  relative  expression  of  tRNA 
isoacceptors.  These  variations  become  more  clear  when  tRNA  expression  levels  are  normalized  to 
the  median  value  within  each  sample  (Fig.  2D).  However,  tRNA  expression  pattern  in  the  three 
major  sub-types  of  breast  cancer  is  very  similar,  suggesting  that  tRNA  over-expression  is  a  general 
consequence  for  all  breast  tumors. 

We  conclude  that  elevated  tRNA  levels  are  a  hallmark  of  breast  cancer.  Our  results  suggest 
that  tRNAs  can  be  used  as  biomarkers  for  breast  cancer,  although  it  is  best  suitable  to  determine 
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tumorigenic  versus  normal  breast  cells.  Furthermore,  tRNA  expression  measured  by  microarrays 
more  accurately  reflects  the  “functional  abundance”  of  individual  tRNA  species. 

Task  4:  Identify  correlations  between  tRNA  and  mRNA  expression:  ->  Completed. 

The  differential  expression  of  tRNA  isoacceptors  can  be  used  to  regulate  translational 
efficiency  via  the  codon  usage  of  specific  genes.  In  prokaryotes  and  fungi,  differences  in  the 
abundance  of  tRNA  isoacceptors  are  correlated  with  codon  preferences  of  genes  encoding  highly 
expressed  proteins  and  impact  the  synthesis  of  these  proteins  (2-4).  In  humans,  tissue-specific 
differences  in  the  expression  of  individual  tRNA  species  can  correlate  to  the  codon  usage  of  highly- 
expressed,  tissue-specific  genes,  although  this  correlation  was  seen  only  for  a  small  number  of 
tissues  (5). 

tRNA  isoacceptor  levels  in  breast  cancer  cell  lines  may  also  correlate  with  the  codon  usage 
of  certain  genes  that  are  important  for  cancer.  Finding  such  correlations  would  suggest  an  additional 
level  of  translational  regulation  for  breast  cancer  cell  lines,  and  by  extension  for  breast  cancer  in 
vivo.  At  least  three  groups  of  genes  are  relevant  in  seeking  codon  usage-tRNA  correlation  for 
cancer  cell  lines:  cell-line  specific  genes,  cancer-related  genes  and  house-keeping  genes  (Fig.  3A). 
Cell-line  specific  genes  can  be  important  for  distinct  tumorigenic  properties  across  cell  lines; 
cancer-related  genes  can  be  important  for  general  tumor  initiation  and  progression;  and  house¬ 
keeping  genes  are  important  for  cell  growth  and  architecture. 

To  identify  cell  line-specific  genes,  we  used  the  publicly  available  mRNA  expression  data 
for  the  breast  cancer  lines  and  MCF10A  (http://www.ebi.ac.uk/arrayexpress/,  accession  number  E- 
TABM-157).  To  identify  cell  line-specific  genes,  we  used  publicly  available  mRNA  expression  data 
((http://www.ebi.ac.uk/arrayexpress/,  accession  number  E-TABM-157).  Cell  line-specific  genes 
were  selected  based  on  mRNA  expression  level  (7-  to  15-fold  above  the  median  expression  level 
detennined  for  all  genes)  and  high  cell  line/MCFlOA  expression  ratios  (top  20  to  30  genes).  A  cell 
line-specific  gene  set  was  detennined  for  each  breast  cancer  cell  line.  To  identify  cancer-related 
genes,  we  selected  from  a  comprehensive  list  of  potential  breast  cancer  diagnostic  markers 
(http://www.sabiosciences.com/gene_array_product/HTML/OHS-402.html).  Genes  in  this  group 
are  highly  associated  with  breast  cancer  (6).  Functional  groupings  used  in  our  study  include:  cell 
cycle,  cell  growth  and  proliferation,  ECM  molecules,  protein  kinases,  and  transcription 
factors/regulators.  To  identify  house-keeping  genes,  we  selected  the  30  most  highly  expressed 
house-keeping  genes  as  defined  in  a  previous  report  (6).  This  list  includes  ribosomal  proteins,  actin, 
ubiquitin,  and  others. 

For  each  gene  set,  we  compiled  gene  sequences  and  analyzed  them  for  codon  usage 
(expressed  as  number  per  one  thousand  codons,  www.bioinfonnatics.org/sms2)  (Tables  4  and  5). 
Because  certain  tRNAs  read  more  than  one  codon,  we  converted  the  obtained  codon  usage  into 
tRNA-based  codon  usage.  For  example,  tRNAArg(ItG)  reads  both  CGU  and  CGC.  Its  tRNA-based 
codon  usage  is  therefore  equal  to  the  sum  of  the  CGU  and  CGC  codon  usages.  For  simplicity,  we 
refer  to  tRNA-based  codon  usage  as  codon  usage  throughout  our  analysis. 

We  first  compared  the  codon  usages  of  each  gene  set  in  the  three  gene  groups  (cell  line- 
specific,  cancer-related,  and  housekeeping)  against  each  other  (Fig.  3B).  Though  there  is  limited 
overlap  across  cell  line-specific  gene  sets,  their  codon  usage  was  remarkably  similar  (average  rs  = 


3 


0.92  ±  0.04).  The  codon  usage  of  cell-line  specific  gene  sets  also  correlates  with  the  codon  usage  of 
the  housekeeping  genes  (average  rs  =  0.80  ±  0.08),  but  to  a  significantly  lesser  extent  (p-value 
<0.01).  Similarly,  the  codon  usages  of  cancer-related  gene  sets  correlate  remarkably  welll  with  each 
other  (average  rs  =  0.92  ±  0.05)  but  to  a  lesser  extent  with  the  housekeeping  genes  (average  rs  =  0.83 
±  0.06,  p-value  <0.01).  These  results  suggest  functionally  different  gene  groups  have  significantly 
different  codon  usages. 

Is  the  codon  usage  of  cell-line  specific  genes  related  to  the  over-expression  patterns  of 
nuclear-encoded  tRNAs  in  breast  cancer  cells?  We  plotted  relative  isoacceptor  levels  (derived  from 
comparative  tRNA  measurements)  versus  the  codon  frequency  of  cell-line  specific  genes  (derived 
from  comparative  mRNA  analysis)  to  detennine  whether  the  changes  in  tRNA  levels  favor  the 
codon  usage  of  these  genes.  No  obvious  correlations  were  observed  (not  shown).  The  absence  of 
correlations  may  be  explained  by  the  low  mRNA  level  of  these  genes  at  the  global  scale  (Fig.  3A). 
The  cell  line-specific  genes  identified  for  this  study  are  expressed  only  7-  to  15 -fold  above  the 
median  expression  level  of  all  genes,  compared  to  approximately  80-fold  above  median  for  the 
housekeeping  genes.  Among  human  tissues  examined,  a  significant  tRNA  abundance-codon  usage 
correlation  was  found  only  in  liver  (5).  The  mRNA  levels  of  liver-specific  genes  approach  those  of 
house-keeping  genes  (200-fold  above  median),  sufficiently  high  for  tRNAs  to  adjust  to  their 
respective  codon  usages.  Because  the  codon  usage  of  line-specific  genes  is  different  from  that  of 
house-keeping  genes,  adjusting  tRNA  levels  to  favor  expression  of  cell  line-specific  genes  would  be 
unfavorable  for  translation  of  house-keeping  genes. 

The  codon  usage  of  cancer-related  genes,  however,  appears  to  have  some  correlations  to 
relative  tRNA  over-expression  in  cancer  cell  lines  (Fig.  3C-E).  Since  the  tRNA  over-expression 
pattern  is  similar  for  the  nuclear-encoded  tRNAs  across  all  lines  examined,  we  used  the  average 
tRNA  over-expression  for  all  6  cancer  lines  for  this  analysis.  Bearing  in  mind  that  tRNA  over¬ 
expression  cannot  favor  all  codons  of  cancer-related  genes  because  such  tRNA  adjustment  would 
diminish  the  translational  efficiency  of  house-keeping  genes,  we  reasoned  that  a  positive  correlation 
should  reveal  itself  only  for  codons  that  are  strongly  over-represented  in  cancer-related  relative  to 
house-keeping  genes.  A  positive  correlation  between  tRNA  over-expression  and  codon  usage  is 
indeed  observed  for  codons  over-represented  by  2-fold  or  more  for  the  cell  cycle,  extracellular 
matrix,  and  transcription  factor  groups  (Fig.  3C).  Among  the  group  of  28  cell  cycle  genes  ranked  by 
their  average  mRNA  expression  levels  in  all  lines,  a  similar  correlation  can  be  seen  when 
comparing  the  codons  that  are  over-represented  in  the  9  genes  with  the  highest  mRNA  expression 
versus  the  9  genes  with  the  lowest  mRNA  expression  (Fig.  3D).  Finally,  tRNAArg  isoacceptors  seem 
to  be  particularly  tuned  to  increase  the  translational  efficiency  of  Arg-codons  of  the  cell  cycle, 
extracellular  matrix  and  transcription  factor  genes  (Fig.  3E). 


Task  5:  Identify  correlations  between  tRNA  and  active  protein  synthesis:  In  development. 

Based  on  the  results  from  Task  4,  we  have  selected  several  gene  targets  for  protein 
expression  studies.  We  are  still  working  on  refining  the  methodology  for  this  study. 


KEY  RESEARCH  ACCOMPLISHMENTS: 
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Determined  the  feasibility  of  using  tRNA  expression  as  possible  biomarkers  for  breast 
cancer  cell  lines. 

Detennined  the  feasibility  of  using  tRNA  expression  as  possible  biomarkers  for  tumor 
versus  nonnal  breast  tissues. 

Detennined  that  tRNA  expression  has  some  conelation  to  optimal  translation  of  breast 
cancer  related  genes. 

REPORTABLE  OUTCOMES: 

A  revised  manuscript  is  now  under  review  in  Nucleic  Acids  Research.  Title:  tRNA  Over¬ 
expression  in  Breast  Cancer  and  Functional  Consequences.  Authors:  Mariana  Pavon- 
Eternod,  Suzanna  Gomes,  Renaud  Geslain,  Qing  Dai,  Marsha  Rich  Rosner,  Tao  Pan. 


CONCLUSION: 

Our  results  show  for  the  first  time  that  tRNAs  are  good  candidates  as  molecular  biomarkers 
for  breast  cancer  cells.  This  conclusion,  initially  established  in  breast  cancer  cell  lines,  has  been 
firmly  established  in  tumor  and  nonnal  breast  tissues.  Both  nuclear  and  mitochondrial-encoded 
tRNAs  show  distinct  patterns  in  different  breast  cancer  cells  with  varying  tumorigenic 
characteristics. 

Another  important  aspect  of  our  tRNA  study  is  to  discover  the  possibility  that  altering  tRNA 
expression  in  breast  cancer  may  lead  to  changes  in  cellular  behavior.  We  have  found  correlations 
between  tRNA  overexpression  and  the  codon  usage  of  breast  cancer  related  genes.  Altering  this 
relationship  may  lead  to  the  identification  of  certain  tRNAs  or  their  associated  protein  enzymes  as 
potential  new  drug  targets  for  breast  cancer. 
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Appendices: 


Figure  legends: 

Figure  1  -  Relative  abundance  of  nuclear  and  mitochondrial  encoded  tRNAs  in  breast  cancer  cells. 
Data  is  shown  for  three  breast  epithelial  cell  lines  (MCF10A,  184  Al,  184  B5)  and  six  breast  cancer 
cell  lines  (MDA-MB-231,  MCF7,  HCC70,  ZR-75-1,  MDA-MB-436,  BT-474),  all  relative  to 
MCF10A.  (A)  Mean  and  median  values  of  the  nuclear  (left)  and  mitochondrial  (right)  encoded 
tRNAs.  (B)  Total  tRNA  quantified  by  agarose  gel  electrophoresis  for  all  samples.  All  RNAs  are 
detected  by  ethidium  bromide  staining  and  quantified  using  a  PharosFX  Molecular  Imager.  Fraction 
of  total  tRNA  was  measured  relative  to  the  non-tRNA  bands  in  the  same  lane,  and  then  nonnalized 
to  that  of  MCF10A.  (C)  Expression  of  nuclear  and  mitochondrial  encoded  tRNAs  shown  as 
TreeView  image.  All  values  are  relative  to  MCF10A.  Green  indicates  a  decreased  level  of 
expression;  red  indicates  an  increased  level  of  expression  relative  to  MCF10A.  Data  are  grouped 
according  to  amino  acid  type.  (D)  Expression  of  nuclear  and  mitochondrial  encoded  tRNAs 
normalized  to  median  shown  as  TreeView  image.  All  values  are  relative  to  MCF10A  and 
normalized  to  the  median  value  for  each  sample.  Green  indicates  a  decreased  level  of  expression; 
red  indicates  an  increased  level  of  expression  relative  to  median.  Data  are  grouped  according  to 
corresponding  amino  acid  type.  (E)  Sames  as  (B),  data  are  grouped  from  high  to  low  expression. 

Figure  2  -  tRNA  over-expression  in  breast  cancer  in  vivo.  Data  is  shown  for  3  normal  breast  tissue 
samples  (A-01,  A-03,  S-23),  4  ER-/HER2-  tumor  samples  (59826,  60046,  62706,  62944),  2  ER- 
/HER2+  tumor  samples  (46258,  58955),  and  three  ER+/HER2-  tumor  samples  (41299,  57731, 
45163).  All  data  is  relative  to  MCF10A.  (A)  Mean  and  median  values  of  the  nuclear  (left)  and 
mitochondrial  (right)  encoded  tRNAs.  (B)  Expression  of  nuclear  and  mitochondrial  encoded  tRNAs 
shown  as  TreeView  image.  All  values  are  relative  to  MCF10A.  Green  indicates  a  decreased  level  of 
expression;  red  indicates  an  increased  level  of  expression  relative  to  MCF10A.  Data  are  grouped 
according  to  amino  acid  type.  (D)  Expression  of  nuclear  and  mitochondrial  encoded  tRNAs 
nonnalized  to  median  shown  as  TreeView  image.  All  values  are  relative  to  MCF10A  and 
nonnalized  to  the  median  value  for  each  sample.  Green  indicates  a  decreased  level  of  expression; 
red  indicates  an  increased  level  of  expression  relative  to  median.  Data  are  grouped  according  to 
conesponding  amino  acid  type.  (E)  Sames  as  (B),  data  are  grouped  from  high  to  low  expression. 

Figure  3  -  Analysis  of  codon  usage  versus  tRNA  over-expression.  (A)  Three  gene  groups  are 
relevant  in  this  analysis  (mRNA  expression  level  is  derived  from  signals  on  Affymetrix  mRNA 
arrays).  tRNA  expression  or  over-expression  when  comparing  cancer  and  non-tumorigenic  cells  are 
unlikely  to  positively  correlate  to  the  codon  usage  of  all  three  groups.  (B)  Codon  usage  correlation 
between  cell-line  specific  genes,  cancer-related  genes  and  housekeeping  genes.  The  degree  of 
correlation  was  assessed  using  Spearman’s  rho  (rs).  Mean  rs  values  are  plotted  for  the  following 
correlations:  cell  line  vs.  cell  line,  cell  line  vs.  housekeeping,  cancer-related  vs. cancer-related,  and 
cancer-related  vs.  housekeeping.  Error  bars  indicate  standard  deviation  from  the  mean.  (C) 
Correlation  of  relative  tRNA  levels  to  ratios  of  codon  usage  between  cancer-related  and  house¬ 
keeping  genes.  As  discussed  in  the  text,  a  positive  correlation  is  only  expected  for  the  codons  that 
are  over-represented  in  the  cancer-related  genes  (x  >2).  (D)  Correlation  of  relative  tRNA  levels  to 
ratios  of  codon  usage  between  the  top-third  (9  genes)  and  bottom-third  (9  genes)  transcribed  cell 
cycle  genes.  Again,  a  positive  correlation  is  only  expected  for  the  codons  that  are  over-represented 
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in  the  top  third  genes  (x  >1.2).  (E)  Correlation  of  relative  arginine  tRNA  isoacceptor  levels  to  the 
arginine  codon  frequency  of  cancer-related  genes. 
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Table  1.  Breast  Cell  Line  Characteristics. 

Relevant  characteristics  of  tumorigenic  and  non-tumorigenic  cell  lines  used  in  this  study  are 
summarized  here.  Estrogen  receptor  (ER),  progesterone  receptor  (PgR),  primary  tumor  type,  tissue 
source,  tumorigenicity,  and  tumor  classification  are  indicated.  This  table  is  based  on  previously 
published  data  (Lacroix  and  Leclercq,  2004;  Neve  et  al.,  2006). 


Cell  Line 

ER 

PgR 

HER2 

Tumor  Type 

Tissue  Source 

Tumorigenic 

Tumor  Classification 

MCF10A 

- 

- 

Fibrocystic  Disease 

Mammary  gland 

No 

N/A 

184  Al 

- 

- 

Normal 

Mammary  gland 

No 

N/A 

184  B5 

- 

- 

Normal 

Mammary  gland 

No 

N/A 

BT-474 

+ 

+ 

+ 

Invasive  ductal  carcinoma 

Primary 

Yes 

Luminal 

HCC70 

- 

- 

Ductal  carcinoma 

Primary 

Unknown 

Basal  A 

MCF7 

+ 

+ 

Invasive  ductal  carcinoma 

Metastasis  -  Pleural  effusion 

Yes 

Luminal 

MDA-MB-231 

- 

- 

Adenocarcinoma 

Metastasis  -  Pleural  effusion 

Yes 

Basal  B 

MDA-MB-436 

- 

- 

Adenocarcinoma 

Metastasis  -  Pleural  effusion 

Yes 

Basal  B 

ZR-75-1 

+ 

+ 

Invasive  ductal  carcinoma 

Metastasis  -  Ascites 

Yes 

Luminal 

Table  2.  Breast  Cell  Line  Doubling  Times 

Cell  proliferation  was  measured  over  four  days  using  the  Promega  CellTiter  Blue  metabolic  assay.  Cells  were 
plated  at  low  cell  density  (1 ,500  cells/well)  and  high  cell  density  (5,000  cells/well).  Doubling  times  were 
calculated  from  the  equation:  N/No  =  e(kt).  Values  are  averages  of  three  replicates. 


Doubling  time  (h) 

Low  cell  density 

High  cell  density 

MCF10A 

ND 

27  ±5 

184  A1 

31  ±5 

47  ±6 

184  B5 

21  ±5 

38  ±5 

BT-474 

21  ±5 

35  ±3 

HCC70 

27  ±4 

ND 

MCF7 

28  ±6 

15  ±  1 

MDA-MB-231 

26  ±2 

25  ±3 

MDA-MB-436 

29  ±3 

25  ±6 

ZR-75-1 

18  ±0.3 

ND 

Table  3.  Characteristics  of  breast  tumor  and  normal  breast  tissue  samples. 

Relevant  characteristics  of  breast  tumor  and  normal  breast  tissue  samples  analyzed  in  this  study  are 
summarized  here.  Race,  age,  gross  description,  diagnosis,  estrogen  receptor  (ER),  progesterone 
receptor  (PR),  and  HER2  status  are  listed  here.  For  the  breast  tumor  samples,  this  table  is  based  on 
pathology  reports  provided  by  the  University  of  Chicago  Human  Tissue  Resource  Center.  For  the 
normal  breast  tissue  samples,  this  table  is  based  on  data  sheets  provided  by  the  vendor  (see 
Materials  and  Methods). 


Sample  ID 

Race 

Age 

Gross  Description 

Diagnosis 

ER 

PR 

HER2 

59826 

Unknown  or  not  reported 

Unknown 

Invasive  Ductal  Carcinoma,  Not  Otherwise  Specified 

B 

B 

60046 

Unknown  or  not  reported 

Tumor 

Ductal  Breast  Carcinoma  in  Situ 

62706 

Unknown  or  not  reported 

48 

Tumor 

Invasive  Ductal  Carcinoma,  Not  Otherwise  Specified 

H 

H 

I^HI 

62944 

African  American  or  Black 

51 

Tumor 

Invasive  Ductal  Carcinoma,  Not  Otherwise  Specified 

46258 

Unknown  or  not  reported 

34 

Tumor 

Ductal  Breast  Carcinoma 

■ 

■ 

+ 

58955 

Indian 

75 

Tumor 

Ductal  Breast  Carcinoma 

+ 

41299 

White  or  Caucasian 

55 

Unknown 

/ 

+ 

+ 

- 

57731 

White  or  Caucasian 

59 

Tumor 

Ductal  Breast  Carcinoma  in  Situ 

+ 

+ 

- 

45163 

Unknown  or  not  reported 

62 

Tumor 

Ductal  Breast  Carcinoma  in  Situ 

+ 

- 

+ 

A-01 

White  or  Caucasian 

Normal 

/ 

A-06 

White  or  Caucasian 

Normal 

/ 

■ 

■ 

S-23 

White  or  Caucasian 

41 

Normal 

Fibrocystic  change 

Table  4  -  Codon  usage  compilation  of  breast  cancer  cell  line-specific  genes. 

a.  To  define  highly-expressed  genes  in  breast  cancer  cell  lines,  we  set  a  threshold  of  2.6-  to  2.8-fold  above 
the  median  expression  value  for  all  transcripts. 

b.  The  top  20  to  30  transcripts  based  on  cell  line/MCFI  0A  expression  ratio  were  selected  for  codon  usage- 
tRNA  correlation  analysis.  The  median  expression  level  for  these  transcripts  is  indicated  here,  relative  to 
the  median  value  for  all  transcripts. 


Cell  Line- 
Specific  Genes 

Threshold3 
(fold  over  median) 

Median  expression 
levelb  (fold  over  median) 

Number  of 
genes 

Number  of  codons 
compiled 

MDA-MB-231 

2.6 

7.0 

26 

36468 

MCF7 

2.7 

8.2 

26 

26931 

HCC70 

2.8 

14.8 

24 

47364 

ZR-75-1 

2.6 

11.0 

26 

28950 

MDA-MB-436 

2.8 

7.5 

27 

31494 

BT-474 

2.6 

14.6 

22 

38133 

Table  5  -  Codon  usage  compilation  for  house-keeping  and  cancer-related  genes. 


Cancer-related  genes 

Number  of  genes 

Number  of  codons  compiled 

Housekeeping 

29 

19835 

Cell  cycle 

28 

56850 

Cell  growth  and  proliferation 

35 

60831 

ECM  molecules 

21 

52353 

Protein  kinases 

17 

45882 

Transcription  factors  and  regulators 

16 

40893 
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